Rank | Count | Beginning |
---|---|---|
8551 | 8808 | આ |
48133 | 1812 | તે |
42137 | 864 | જો |
48460 | 855 | તેઓ |
53085 | 741 | તેમણે |
24123 | 678 | એક |
44009 | 646 | જ્યારે |
40101 | 612 | જે |
4768 | 590 | અને |
50715 | 564 | તેના |
16313 | 487 | આમ |
5497 | 478 | અન્ય |
7406 | 463 | અહીં |
53837 | 432 | તેમના |
51438 | 421 | તેની |
42207 | 418 | જોકે |
42210 | 369 | જોકે, |
63806 | 368 | પરંતુ |
55131 | 352 | તેમાં |
50044 | 350 | તેણે |
30809 | 345 | કેટલાક |
54321 | 340 | તેમની |
24034 | 322 | એ |
41395 | 322 | જેમાં |
52066 | 301 | તેને |
79349 | 285 | મોટા |
56483 | 282 | ત્યાર |
93189 | 260 | સામાન્ય |
86155 | 247 | વર્ષ |
36199 | 229 | ઘણા |
In the next four subsections show the most frequent sentence beginnings consisting of N words, N=1, 2, 3, 4. In this subsection we start with N=1.
The most frequent word-N-grams at the beginning of sentences give some insight into sentence composition.
Especially for N=1, we only need a small corpus to identify the most frequent sentence beginnings.
select substring_index(sentence, ' ', 1) as beg, count(*) as cnt from sentences group by substring_index(sentence, ' ', 1) order by cnt desc limit 50;
4.3.1.2 Most Frequent Sentence Beginnings II
4.3.1.3 Most Frequent Sentence Beginnings III
4.3.1.4 Most Frequent Sentence Beginnings IV
4.3.1.1 Most Frequent Sentence Endings I
4.3.1.2 Most Frequent Sentence Endings II
4.3.1.3 Most Frequent Sentence Endings III
4.3.1.4 Most Frequent Sentence Endings IV